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Total Variation Minimization Based Compressive Wideband Spectrum Sensing for 

Cognitive Radios 

Yipeng Liu and Qun Wan 
Abstract 

Wideband spectrum sensing is a critical component of a functioning cognitive radio sys- 
tem. Its major challenge is the too high sampling rate requirement. Compressive sensing (CS) 
promises to be able to deal with it. Nearly all the current CS based compressive wideband 
spectrum sensing methods exploit only the frequency sparsity to perform. Motivated by the 
achievement of a fast and robust detection of the wideband spectrum change, total variation 
minimization is incorporated to exploit the temporal and frequency structure information to 
enhance the sparse level. As a sparser vector is obtained, the spectrum sensing period would 
be shorten and sensing accuracy would be enhanced. Both theoretical evaluation and numerical 
experiments can demonstrate the performance improvement. 

Index Terms 

cognitive radio, dynamic spectrum access, wideband spectrum sensing, compressive sensing, 
total variation minimization. 

I. Introduction 

Windband spectrum sensing is a critical component of a functioning cognitive radio system [1J. To 
simultaneously grab any available channel for secondary systems while avoiding harmful interference to 
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primary receivers, cognitive radios should be able to sense a wide range of frequencies to find "blank 
spaces" in the licensed bands which are not being used by primary users at a particular place and time 
0. 

There are three ways to perform the wideband spectrum sensing: One approach utilizes a bank of fixed 
narrowband bandpass filters. But it requires an enormous number of Radio Frequency (RF) components 
and bandpass filters, which lead to high cost. Besides, the number of the bands is fixed and the filter range 
is always preset. Thus the filter bank way is not flexible. Another approach use a tunable bandpass filter 
(BPF). Based on a one-by-one search strategy, a BPF can be employed to search one frequency band at a 
time. However, this approach results in a significantly long search time 0. The third one is to implement 
a wideband circuit using a single RF chain followed by high-speed DSP to process the full range of the 
available spectrum at once H). And the major challenge is the too high sampling rate requirement. 

Compressive sensing (CS) promises to be able to deal with the high sampling rate problem [|4]| j6l j5]|. 
To exploit the frequency sparsity of the received wideband signal, CS is a novel sampling technique that 
can sample sparse signals with sub-Nyquist rates and performs accurate recovery. The survey shows that 
the fixed spectrum allocation leads to a low spectrum utilization ratio by the primary radios Q, which 
is also the reason that cognitive radio can use the "blank spectrum holes" to settle the spectrum scarcity 
problem. Furthermore, the low spectrum utilization also says that the wiedeband signal is sparse in a wide 
range of frequency domain with large probability. 

CS has turned out to be an effective method to address the wideband spectrum sensing. A pioneering 
work was done in (81 [0 IfTOl , where CS is exploited for wideband spectrum sensing and the standard 
11 norm minimization is used to estimate the original spectrum. Other different types of implementation 
structures for compressive wideband spectrum sensing have been proposed in IfTTI IPT21 ITOI lfT31 [[T5l . 
However, nearly all these methods exploit only the frequency sparsity to perform compressive wideband 
spectrum sensing. 

In this paper, the temporal and frequency structure information is jointly taken to incorporate in the 
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compressive wideband spectrum sensing via total variation minimization technique. In the wide monitoring 
frequency band, though the nonzero frequencies are sparse globally, yet in some local subbands, the 
nonzero entries may be distributed densely 0. Therefore, a differential operation on the power spectrum 
state vector would result in a sparser one. Besides, It is typically that the spectrum occupancy states 
change slowly in time 0. And when the initial spectrum states is obtained, it is not the spectrum state 
but the change of the spectrum states that controls the cognitive radios to perform the dynamical secondary 
spectrum access. The gradient operation on the spectrum state vectors at two adjacent detection periods 
can give the the spectrum state change vector which can give sufficient information to enable dynamical 
spectrum access. Furthermore, as the primary signal is slow-varying, the spectrum state change vector 
would be sparser than the spectrum state vector. To achieve the differential operation in the joint temporal 
and frequency domain, total variation [fT6ll , which can be interpreted as the 11 norm of the (appropriately 
discretized) gradient is minimized here. As the sparser vector is used, the spectrum sensing performance 
would be enhanced. 

In the rest of the paper, section II gives the sparse spectrum signal model; In section III, the previous 
standard compressive wideband spectrum recovery method is described; section IV develops the total vari- 
ation minimization based compressive wideband spectrum sensing scheme; Section V briefly compares the 
performance; In section VI, numerical experiments is taken to demonstrate the performance enhancement 
of the proposed method; Finally Section VI draws the conclusion. 

II. Signal Model 

Suppose that there exists several primary users within a wide frequency range. The entire frequency 
band is segmented into non-overlapping subbands. Primary users employ Frequency Division Multiplexing 
with fixed channelization. The locations of these subbands are assumed to be preset and known. But the 
corresponding power spectral density (PSD) levels are slowly varying in different spectrum sensing periods. 

Different types of implementation structures for random sampling for compressive wideband spectrum 
sensing have been proposed in IfTTTl lfT8ll |fT9ll GUI . The operation to obtain the random samples is usually 
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called analog-to-information convertor (AIC). Briefly an AIC may be conceptually viewed as an ADC 
operating at Nyquist rate, followed by compressive sampling. Here the AIC is taken to sample the analog 
baseband signal x(t). For illustration convenience, we detail the algorithm in discrete setting as it did in 
|S1 H5| HDI HQ Q^l (131 H51- Denote the N x 1 stacked vector at the output of the ADC by 



XtN XtN+l " " ' %+l)W-l 



t = 0, 1, ••• , T-l 



(1) 



where the time window for the ADC is set to be the Nyquist sampling duration to recover the signal 
without aliasing. The corresponding random samples of AIC is: 



jtM ytM+i • • • y(t+i)M-\ 



, t = 0, 1, ••• , T-l 



(2) 



With the M xN random measurement matrix tD, we can formulate the random sampling as: 



(3) 



As only PSD is needed to be estimated here, the phase information of the samples are not necessary. 
The PSD vector can be obtained directly from the autocorrelation of the samples. Here the signal is 
assumed to be zero-mean, wide-sense stationary. The autocorrelation at lag j are r x (j) = E [x„x*_ ; j and 
r y (j) = £ [y„y*_ 7 ]. The autocorrelation vectors of the Nyquist samples CD and the compressive samples 
© are 



r r , = 



r x , t (-N + l) ■■■ r x M ■•■ r XJ (N-\) 



(4) 



and 



r >,t = 



r w (-M+l) 



r y , t (M-l) 



(5) 
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respectively. In practice, estimates of the autocorrelation are obtained by averaging over several signal 
segments. Here it can be easily get: 



R y = E [y*yf ] = OR s (D fl 



(6) 



where 



[R x ] i j = r x (i-j) = r;(j-i) 



(7) 



and 



(8) 



Then the relation between the Nyquist autocorrelation vector (NAV) © and the compressive autocorre- 
lation vector (CAV) © can be formulated H3]| : 



r y = Ar x 



where 



(9) 



A = 



Denoting the (i, j)-th element of <D,j by <p*j. The M x N matrix has it entry as: 



(10) 



And the N x N matrices cD l5 <T> 2 , 3 and 4 are: 



i=l,j=l,...,N 
M+ 2-jj i*l,j = !,••', N 



(11) 



Oi = hankell [0/vxi] , 



cp 



i.i 



(12) 
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<D 2 = hankel 



+1 



1JV 



Kn 



lx(iV-l) 



(13) 



3 = toeplitz \[0 Nx i] 



<pl,N ■ ■ ■ 01,2 



(14) 



<D 4 = teoplitz 



1.1 



0i,i 



lx(iV-l) 



(15) 



where hankel(a, b) is a hankel matrix (i.e., symmetric and constant across the anti-diagonals) whose first 
column is a and whose last row is b; and toeplitz(a, b) is a toeplitz matrix (i.e., symmetric and constant 
across the diagonals) whose first column is a and whose first row is b. 

To monitor a broad band, high sampling rate is needed. It is often very expensive. Besides, too many 
sampling measurements inevitably increase the storage space and computational burden for (DSP), while 
spectrum sensing requires a fast and accurate algorithm. CS provides an alternative to the well-known 
Shannon sampling theory. CS is a framework performing non-adaptive measurement of the informative 
part of the signal directly on condition that the signal is sparse. According to the FCC report 0, the 
allocated spectrum is in a very low utilization ratio. That means the sparsity inherently exists in the 
wideband spectrum. As x f has a sparse representation in frequency domain: 



r x = (16) 

where *F is the IN x IN inverse discrete Fourier transform (DFT) matrix, and p is the 2N x 1 PSD vector. 
And Most of the entries of p are zero or almost zero. When the number of nonzero elements of p is S 
(S «; AO, the signal is said to be 5-sparse. 

Here an M xN random projection matrix <D = S is used to sample signals, i.e. y, = Sx f , where M < N 
and S is a non-Uniform Subsampling or Random Subsampling matrix which are generated by choosing 
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M separate rows uniformly at random from the unit matrix l N . Combining © and (fl6l) , the measurement 
samples can be represented as 



r vf = ATp, = Dp, (17) 
where D is the 2M x IN dictionary for the compressive sparse representation. 

III. The Standard Compressive Wideband Spectrum Sensing 

CS theory asserts that, if a signal has a sparse representation in a certain space, one can use the random 
measurement to obtain the samples and reconstruct it with overwhelming probability by optimization on 
condition that the random measurement matrix satisfies the restricted isometry property (RIP) which is 
a condition on matrices *D which provides a guarantee on the performance of <D in CS. It can be stated 
asflU 01: 

(l-^)||y^<||Oy|g<(l + ^)||y||2 (18) 

or all the 5-sparse y. where ||y|| 2 = |y,-| 2 ) / is the norm of the vector y = [yi,y2, • • • ,)>m] T - The 
restricted isometry constant 6 S e (0, 1) is defined as the smallest constant for which this property holds for 
all 5-sparse vectors y. There are three kinds of frequently used measurement matrices, and the Random 
Subsampling matrix S which is chosen here is one of the three. 

When the RIP holds, a series of recovering algorithm can reconstruct the sparse signal. One is greedy 
algorithm, such as matched pursuit (MP) fl2T|, orthogonal matched pursuit (OMP) 11221 ; another group is 
convex program, such as basis pursuit (BP) $Z5$ UM, LASSO [|25l fl26]| Wl BH and Dantzig Selector 
(DS) [29]. Comparing these algorithms, DS has almost the same performance as LASSO. Both of these 
groups has the advantages and disadvantages. Briefly it can be summarized that convex program algorithm 
has a more reconstruction accuracy while greedy algorithm has less computing complex. 
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To find the unoccupied spectrum for the secondary access, the signal in the monitoring band is down- 
converted to baseband and sampling the resulting analogue signal through an AIC that produces samples 
at a rate below the Nyquist rate. Afterwards, the autocorrelation of the samples is acquired. 

Now the PSD of y(t) is estimated from the autocorrelation. With the random measurement matrix <D = S 
and the random measurement vector y t t = 0, 1, • • • , T , the dictionary D can be the autocorrelation 
r y f and the CWSS can be formulated as 

p = argmin||p|| 

p (19) 

s.ty, = Dp 

where ||p|| is the ^ norm which counts the number of nonzero entries of the vector p. The minimization 
of the norm is the best candidate to encourage the sparse distribution. However, it is not a convex 
function and the formulated CWSS (PT91 is not a convex programming. 

To convert the CWSS model to a convex programming, the norm is relaxing to the ^ norm. Further 
more, to enhance the noise suppression performance, LASSO is taken to do the CWSS. It is a shrinkage 
and selection method for linear regression. It minimizes the usual sum of squared errors, with a bound on 
the sum of the absolute values of the coefficients. To get more accuracy, we can reformulate the CWSS 
based on LASSO as 

Plasso = argmin||p||i 

p (20) 

s.t. ||y f - Dp|| 2 < n 

where \x is a parameter bounding the amount of noise in the data, and ||p| | x = \Pi\ is the ^ norm of the 
vector p = [p(l), P(2), • • • , p(2N)] T . This problem is a second order cone program (SOCP) and can therefore 
be solved efficiently using standard software packages. It finds the smallest ^ norm of coefficients among 
all the decompositions that the signal is decomposed into a superposition of dictionary elements. It is a 
decomposition principle based on a true global optimization. A number of convex optimization software, 
such as cvx 11301 , SeDuMi [31 1 and Yalmip ||32l can be used to solve the problem. 
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IV. The Proposed Compressive Wideband Spectrum Sensing 



The standard sparse recovery method based CWSS only exploits the sparsity in frequency domain. To 
further enhance the sensing performance, other structure information can be incorporated. One is that 
the nonzero frequencies are in clusters [H, and the PSD level in every cluster is almost similar. The 
differential of the PSD sequence would result in a sparser sequence with most of the nonzero elements 
locating at the band boundaries. The other structure information is that the similarity of two PSD vectors 
in adjacent sensing periods is very high as the primary signal is often slow- varying. The differential of 
the adjacent PSD vector would also result in a sparser vector than the original PSD vectors. 

To integrate the differential operations in both temporal and frequency domain, the total variation, which 
can be interpreted as the ^ norm of the gradient, can be minimized. In CS, the number of measurements 
required to successfully recover signal is in proportion to the sparsity degree 10. Hence, the differential 
operation can reduce the number of measurements required, especially when there are wideband signals 
in the sensing range. Combining sparsity of PSD and differential PSD, we can obtain an total variation 
minimization (TVM) based CWSS. 

A. The TVM Based Compressive Wideband Spectrum Sensing 

Assume for each spectrum sensing period, there are T measurement vectors. After the autocorrelation 
operation, we formulate them into a matrix and get that: 



R 



■2MxT ~ r-yj r-y.2 



(21) 



PU P',2 



(22) 



Thus we can reformulate (fTTT) in matrix as: 



R-2Mxr - ^2Mx2N~P2NxT 



(23) 
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Then we can let: 



*2TMX1 = Vec(R2Mx7-) 



PuNxi - vec (P2Nxt) 



B 



2TMX2TN - T>2Mx2N ® ^-TxT 



Thus we can get: 



r 2TMxi — ^2TMx2TN\>2TNx\ 



The total variation can be regarded as 2-dimensional differential as: 



PV = £ 



\P(i,j)-P(i-l,j)\ + \P(i,j)-P(i,. 
+ \P(i,j)-P(i+l,j)\ + \P(i,j)-P( 



Here the PV can be reformulated as: 



PV - ||V 8r jv x 2rArP27Wxllll 



where 



&TNX2TN — 



(Vl)2TNx2TN 
(^2)2TNx2TN 
(^3)2TNx2TN 
(Sa)2TNx2TN 
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1 







V! = 



-110 



0-110 











(31) 











0-110 



0-11 



v 2 = 



v 4 = 



-li,i o 
-ha 





ll,2iV+l 
1 2,2^+2 



-1 



2TN,2N+l 







V, = 



1-10 



0-110 

















1-10 
1-1 
1 



li.i o 

1 2 ,2 





- ll,2iV+l 

-12.2W+2 













1 



1.2AT+1 



-1 



2TN,2TN 



Based on the above TVM formulation, the CWSS can be formulated as: 



(32) 



(33) 



(34) 
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Pt-vmi = argmin||Vp|| 



p 



s.t. ||Y - BP|| 2 < n 



(35) 



p = vec (P) 



In TVM based CWSS (TVM-CWSS) (1351) . a more sparser vector is estimated. Thus the sensing 
performance would be enhanced. 

There are T segments of measurement vectors in a sensing period. However, when there are only one 
measurement vector at current time, the TVM can exploit the sparsity in the differential between the 
estimated PSD vector and the last time's PSD. Accordingly, with T = 2, the TVM-CWSS (f35]) would 
give the estimated wideband spectrum at current time. 

From multiple estimated PSD vectors, the spectrum holes can be located. Also the starting time for of 
primary radios' vacancy can be noticed. Experience tells us that for longer time that the spectrum has 
been in vacancy, the spectrum would be more likely to be unused. 

B. Performance Evaluation 

To Compare the TVM-CWSS and LASSO-CWSS, the required number for successful recovery of the 
sparse signal is analyzed briefly. We assume that there is no measurement noise or perturbation and let 
n = 2N. With overwhelming probability, a single piece of PSD vector can be successfully recovered for 
the convex programming for sparse signal recovery on condition that (61: 



where C is some constant. 

Suppose that the number of nonzero entries in each column of the estimated PSD matrix R is S. Thus 
the total number of measurement for successful LASSO-CWSS (f20l) would be: 



m>CS log (n/S) 



(36) 



m BP > TCS log (n/S) 



(37) 
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To analyze the performance of TVM-CWSS (1351), we assume that for each the average cluster size is d, 
and the number of clusters in a single PSD vector is K (K «: S). Thus we have S = Kd. For a single PSD 
vector recovery, the TVM-CWSS reduce the number of nonzero entries in the estimated sparse vector to 
be 2K. Therefore, the number of measurement for a vector recovery via TVM would be: 

m>KC log (n/(2K)) (38) 

Besides, As a slow-varying signal, only a small part of the positions of the nonzero entries would changed. 
The variation includes both the newly emerging nonzero entries and the disappeared entries. Here we 
assume the number of variation in nonzero elements' positions is A (A «: S). Apart from the first recovery 
of the first column when t = 0, the required measurement number for each PSD column recovery is: 

m > AClog(n/(A)) (39) 

Thus, the total number of measurement for successful TVM-CWSS (1331 ) would be: 

m TVM > C ((T - 1) A log (n/(A)) + K log (n/(2K))) (40) 

As A <sc S and K <sc S , the minimum number of measurements for successful TVM-CWSS would be 
much less than LASSO-CWSS'. 

V. Simulation Results 

Numerical experiments are presented to illustrate performance of the proposed TVM-CWSS for cog- 
nitive radio. Here we consider the initial primary base band signal with its frequency range from 0Hz 
to 500MHz as Fig. \T\ shows. The primary signal with random phase are contaminated by a zero-mean 
additive Gaussian white noise (AGWN) which makes the signal noise ratio (SNR) be lOdB. Five active 
block dense bands are randomly located at eight subbands 46MHz - 50MHz, 56MHz - 60MHz, 141MHz 
- 150MHz, 161MHz - 170MHz, 231-260MHz, 381MHz - 400MHz, 421MHz-425MHz, and 441MHz - 
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445MHz. The obtained wideband signal has about 14.0% nonzero entries, which is consistent to the result 
of practical surveys. Normalizing the energy of the active bands in the wideband, One example of the 
PSD levels can be given out as Fig. Q] shows. Here we take the noisy signal as the received signal x(t). 
As CS theory suggests, we sample x(t) randomly at the sub-Nyquist sampling rate via AIC. The resulted 
sub-sample vector is denoted as y t . Here only one measurement vector are available at current time, and 
the TVM-CWSS exploits the spars ity in the differential between the estimated PSD vector and the last 
times PSD. 

To make contrast, with the same number of samples and fixed nonzero entries in the signal model, 
power spectrum with different methods with 1000 tries averaged are given out in Fig. [2] and Fig. [3] The 
compressive measurement is obtained at 0.25% of the Nyquist rate. The parameter /u is chosen to be 
0.05 ||y,|| 2 - They show that the proposed TVM-CWSS gives better reconstruction performance. To make 
details, it shows that there are too many fake spectrum points in the subbands without active primary 
signals presented in Fig [2] for the standard LASSO. The noise levels of result from LASSO-CWSS 
are quite high along the whole monitoring band; and the real active frequencies can hardly distinguish 
themselves. However, in Fig. [H the active frequencies clearly show up; the noise levels in the inactive 
bands are quite low. Besides, at other sub-sampling rate, the proposed TVM-CWSS also outperforms the 
standard LASSO-CWSS in the same way for wideband spectrum sensing. It demonstrates that exploiting 
the differential frequency sparsity enhances the sensing performance. 

To further evaluate the performance, Monte Carlo simulations are used to test the performance. The 
false alarm ratio vector can be defined as: 



Pf = 



Pf,\ Pf,2 



Pfq 



(41) 



where Q is number of the divided subbands, and for the q-th subband, 



TM(H hq \H , q ) 



(42) 



PF,q = 



L 
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where L is the times of Monte Carlo Simulations and TM(Hi q \H 0q \ counts the times when the q-th 
subband is in vain but the estimation gives the result that it is occupied by primary radio. Similarly, the 
detection ratio vector can be defined as: 



Pd = 

where for the q-th subband, 



Pd,\ Pd,2 • • • Pd,q 



r 

(43) 



TM (H Uq \H lq ) 

Po, q = K — (44) 

where T M (H\ >q \H\ >q ^ counts the times when the estimation correctly gives the result that the q-th sbuband 
is occupied by primary radio. 

For everyone of the L=1000 simuluations, five subbands are randomly chosen from the eight ones to 
be active as it shows before. Fig. |4] illustrates the false alarm performance of the 5-th subband (150MHz 
- 230MHz) via the measurement number for these two CWSS, where the false alarm holds when at least 
one of the elements in the 5-th subband is larger than one of the elements in the active bands. And Fig. 
[5] shows the detection ratio performance of the 6-th subband (231MHz - 260MHz) via the measurement 
number, where the detection holds when all elements in 6-th subband are larger than all of the elements 
in the inactive bands. The sub-sampling percentage to the Nyquist sampling is from 0.2 to 0.8. Obviously 
we can see the TVM-CWSS (HU) has a remarkable performance gain against LASSO-CWSS ([20]) for both 
the false alarm ratio and detection ratio performance. 

VI. Conclusion 

This paper addresses the wideband spectrum sensing for cognitive radio. Because the wideband primary 
signal often has the nonzero frequencies in cluster, the frequency differential would result in a sparser 
vector; and because the primary signal is often slow-varying, the differential of the PSD vectors in adjacent 
sensing period would also give out a sparser vector. Here the total variation minimization is used to perform 
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the joint temporal-frequency differential for compressive wideband spectrum sensing. Both performance 
analysis and stimulations demonstrate that the TVM-CWSS has performance enhancement. 
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Fig. 1. The normalized spectrum of noiseless active primary signals in the monitoring band. 
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Fig. 2. The normalized estimated spectrum via LASSO-CWSS in the monitoring band. 




Fig. 3. The normalized estimated spectrum via TVM-CWSS in the monitoring band. 




Subsampling ratio 



Fig. 4. 



The false alarm performance in the 5-th subband (150MHz - 230MHz) with different sub-sampling rate. 
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Fig. 5. The correct detection performance in the 6-th subband (231MHz - 260MHz) with different sub-sampling rate. 



